Picture for Hu Cao

Hu Cao

AffordanceGrasp-R1:Leveraging Reasoning-Based Affordance Segmentation with Reinforcement Learning for Robotic Grasping

Add code
Feb 03, 2026
Viaarxiv icon

Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation

Add code
Dec 24, 2025
Viaarxiv icon

TUMTraf EMOT: Event-Based Multi-Object Tracking Dataset and Baseline for Traffic Scenarios

Add code
Dec 20, 2025
Viaarxiv icon

BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models

Add code
Apr 02, 2025
Figure 1 for BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models
Figure 2 for BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models
Figure 3 for BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models
Figure 4 for BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models
Viaarxiv icon

Towards Vision Zero: The Accid3nD Dataset

Add code
Mar 15, 2025
Figure 1 for Towards Vision Zero: The Accid3nD Dataset
Figure 2 for Towards Vision Zero: The Accid3nD Dataset
Figure 3 for Towards Vision Zero: The Accid3nD Dataset
Figure 4 for Towards Vision Zero: The Accid3nD Dataset
Viaarxiv icon

CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving

Add code
Mar 09, 2025
Figure 1 for CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving
Figure 2 for CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving
Figure 3 for CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving
Figure 4 for CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving
Viaarxiv icon

TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes

Add code
Feb 04, 2025
Figure 1 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 2 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 3 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 4 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Viaarxiv icon

UniLoc: Towards Universal Place Recognition Using Any Single Modality

Add code
Dec 16, 2024
Viaarxiv icon

Dataset Distillation by Automatic Training Trajectories

Add code
Jul 19, 2024
Figure 1 for Dataset Distillation by Automatic Training Trajectories
Figure 2 for Dataset Distillation by Automatic Training Trajectories
Figure 3 for Dataset Distillation by Automatic Training Trajectories
Figure 4 for Dataset Distillation by Automatic Training Trajectories
Viaarxiv icon

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Add code
Jul 17, 2024
Figure 1 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Figure 2 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Figure 3 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Figure 4 for Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Viaarxiv icon